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Abstract 

This  paper  gives  a brief  Introduction  to  trans- 
ition networks  and  proposes  an  approach  to  the  In- 
ference of  transition  network  grammars. 

I.  Introduction 

In  order  to  model  a language  more  realistically. 
It  Is  desirable  that  the  grammar  used  can  be  directly 
Inferred  from  the  set  of  sample  sentences.  This  pro- 
blem of  learning  a grammar  based  on  a set  of  sample 
sentences  is  called  grammatical  inference.  The  appli- 
cations of  grammatical  Inference  include  areas  of 
pattern  recognition.  Information  retrieval,  artificial 
Intelligence,  and  translation  and  compiling  of  pro- 
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gramming  languages  * . 

A unique  relationship  between  a language  and  a 
grammar  does  not  exist.  Quite  often  different  , 
grammars  generate  the  same  language.  By  definition3, 
two  grammars  are  equivalent  if  and  only  if  they  both 
generate  the  same  language.  It  Is  possible  to  tell 

If  two  finite  state  grammars  are  equivalent^.  How- 
ever, for  two  grammars  of  types  other  than  finite 
state,  there  Is  no  way  of  telling  their  equivalence. 
Thus,  except  for  finite  state  grammars,  the  inference 
problem  does  not  have  a unique  solution  unless  addi- 
tional constraints  are  placed  upon  the  grammar  being 

Inferred*1.  One  of  the  constraints  may  be  to  select  a 
grammar  of  minimum  complexity^. 

In  the  process  of  Inference,  a set  of  sentences 
that  are  known  to  be  In  the  language  must  be  given. 
This  set  Is  called  a positive  sample  of  the  language. 
There  may  be  another  set  of  sentences  given  called  a 
negative  sample  of  the  language  that  are  known  not 
to  be  In  the  language.  A positive  sample  of  a 
language  l(G)  is  said  to  be  structurally  complete  If 
each  rewriting  rule  of  G is  used  in  the  generation  of 
a nonempty  subset  of  the  sample.  In  general,  as- 
sumptions are  made  for  all  existing  Inference  tech- 
niques as  follows: 

1.  The  type  of  the  granmar  being  Inferred  Is 
specified, 

2.  The  given  sample  of  the  language  is  finite, 

3.  The  given  positive  sample  of  the  language  is 
structurally  complete, 

4.  The  Inferred  grammar  G Is  such  that  S*Cl(C) 

and  S*  c L(G),  where  S+  and  s"  are  positive 
and  negative  samples  of  the  language,  re- 
spectively; and  L(G)  Is  the  complement  of  the 
language  1(G). 

^Thls  work  was  supported  by  the  AFOSR  Grant  74-2661. 

*S.  N.  Chou  Is  presently  with  the  Automation  and  Con- 
trol Laboratory,  General  Electric  Corporate  Research 
and  Development,  Schenectady,  New  York  12301. 
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A survey  of  liteij^ure  In  thfc  area  of  grammatical 

Inference  can  be  found  In  In  this  paper,  an 

Introduction  to  transition  networks  will  first  be 
given.  Then  following  a brief  review  of  the  problem, 
an  approach  to  the  Inference  of  transition  networks 
will  be  proposed. 

II.  Transition  Network  Grammars 

The  transition  network  grammar  has  been  developed 
as  a model  of  natural  language  analysis^  ***. 

A basic  transition  network  (BTN)  Is  a directed 
graph  with  labeled  states  and  arcs,  a distinguished 
state  called  the  start  state  and  a distinguished  set 
of  states  called  final  states.  It  looks  essentially 
like  a nondeterministlc  finite  state  transition  dia- 
gram, except  that  the  labels  on  the  arcs  may  be  state 
names  as  well  as  terminal  symbols.  The  Interpretation 
of  an  arc  with  a state  name  as  its  label  is  that  the 
state  at  the  end  of  the  arc  will  be  saved  on  a push- 
down store  and  the  control  will  jump  (without  advancing 
the  Input  pointer)  to  the  state  that  is  the  arc  label. 
When  a final  state  Is  encountered,  the  pushdown  store 
may  be  "popped"  by  transferring  control  to  the  state 
which  is  named  on  the  top  of  the  stack.  An  attempt  to 
pop  an  empty  stack  when  the  last  input  symbol  has  just 
been  processed  Is  the  criterion  for  acceptance  of  the 
Input  string. 

The  TN  described  above  is  a generalized  pushdown 
automaton  and  Is  equivalent  to  a context-free  grammar. 
However,  a TN  could  be  augmented  Into  a more  powerful 
machine  by  adding  facilities  to  each  arc.  These  in- 
clude arbitrary  conditions  which  must  be  satisfied  In 
order  for  the  arc  to  be  followed  and  a set  of  register- 
setting  actions  to  be  executed  If  the  arc  Is  followed. 
The  power  of  an  augmented  transition  network  (ATN)  Is 
determined  by  the  facilities  added  to  the  arcs.  With 
certain  restrictions  of  the  arcs,  the  power  of  the  ATN 
can  be  modified  for  any  kind  of  applications  needed. 

A TN  can  be  described  as  a generalized  pushdown 
machine  consisting  of  a finite  set  of  finite-state 
machines  and  a finite  set  of  pushdown  stores.  Formal- 
ly, a TN  can  be  defined  as  a 6-tuple. 

TN  - (E,  Q,  A,  Qq,  Qf,  q0),  where  E Is  a finite 

set  of  Input  symbols,  Q Is  a finite  set  of  states, 

Qg  C Q Is  the  set  of  Initial  states  of  the  finite- 

state  machines,  is  the  set  of  final  states  of 

the  finite-state  machines,  qQ  c Qq  Is  the  Initial 

state  of  the  TN,  A Is  a finite  set  of  arcs.  Associ- 
ated with  each  state  there  are  several  arcs  for  trans- 
itions and  actions.  The  arcs  can  be  categorized  into 
five  classes: 

I.  CAT  arc:  (CAT  C).  A transition  1s  made  from 

the  present  state  to  the  s.ste  at  the  end  of  the  arc 
consuming  an  Input  symbol  which  Is  In  the  syntactic 
class  labeled  on  the  arc.  The  consumed  symbol  may  be 
saved  In  a hold  list  when  a HOLD  action  is  required 
on  the  arc.  This  Is  done  for  future  tests  of  context 
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relationship. 

2.  POSH  arcs  (PUSH  q0'),  q0'  c Qg.  The  desti- 
nation state  of  the  arc  Is  saved  In  the  pushdown  store 
and  the  state  Is  transferred  to  the  state  shown  on  the 
arc  which  Is  the  Initial  state  of  a finite-state 
■achlne. 

3.  POP  arc:  (POP).  The  state  Is  transferred  to 

the  state  shown  on  the  top  of  the  pushdown  store.  And 
the  stack  Is  popped  one  element  up. 

k.  VIR  arc:  (VIRC).  A transition  from  the 

present  state  to  the  state  at  the  end  of  the  arc  Is 
made  by  testing  for  the  symbol  shown  on  the  VIR  arc 
In  the  hold  list. 

5.  JUMP  arc:  (JUMP).  A transition  from  the 

present  state  to  the  state  at  the  end  of  the  arc  is 
made  If  the  conditions  specified  on  the  arc  are  satis- 
fied. The  transition  does  not  consume  any  of  the  in- 
put string.  This  Is  a means  of  making  a transition 
from  one  state  to  another  without  advancing  the  input 
pointer. 

The  Input  string  is  accepted  when  the  TH  is 
popping  the  empty  pushdown  store  with  empty  hold  list. 
The  language  accepted  by  a TN  is  denoted  as  L(TN). 

It  Is  Interesting  to  compare  the  acceptors  of 
phrase  structure  grammars  with  the  transition  networks. 
Mote  that  a TH  consists  of  a finite  set  of  finite-state 
machines  and  pushdown  stores.  Consider  a TN  with  only 
one  network  which  is  a set  of  states  with  CAT  and  POP 
arcs.  This  appears  exactly  to  be  a finite-state  au- 
tomaton which  accepts  type  3 languages.  Furthermore, 
consider  the  BTN  which  is  a set  of  finite-state 
machines  and  a pushdown  store.  The  BTN  is  equivalent 
to  an  acceptor  of  a context-free  language.  With  the 
aid  of  register-setting  actions  on  the  arc  and  the 
checking  action  of  the  VIR  arcs,  the  ATN  could  achieve 
the  power  of  a Turing  machine.  Suppose  that  there  Is 
a bound  on  the  summing  size  of  all  the  stores  of  the 
ATN  such  that  the  size  is  less  or  equal  to  the  length 
of  the  part  of  the  input  string  not  yet  scanned  by  the 
Input  pointer.  Then  It  appears  to  be  like  a linear 
bounded  automaton  which  accepts  context-sensitive 
languages.  It  is  then  clear  that  an  ATN  is  equivalent 
to  a Turing  machine.  All  the  acceptors  accepting  dif- 
ferent classes  of  languages  can  be  derived  from  special 

cases  of  augmented  transition  networks". 

A detailed  discussion  of  the  relationships  between 
transition  network  grammars  and  Chomsky's  hierarchy  can 
be  found  in  Ref.  II. 
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III.  Inference  for  Finite-State  Automata 

Finite  state  grammars  are  the  simplest  grammars. 
Host  of  the  questions  on  the  characteristics  of  this 
class  of  grammars  are  decidable  or  solvable.  Finite- 
state  languages  are  closed  under  union,  complement, 
and  Intersection.  If  Gj  and  Gj  are  finite-state 

grammars  generating  Lj  and  L2  respectively,  then 

there  Is  an  algorithm  to  determine  If  the  set 

(L,  A L2)  U (L,  A l2)  Is  empty.  If  It  Is  empty, 

Lj  and  l2  are  equivalent,  and  hence  Gj  and  G2  are 

equivalent.  There  are  a number  of  Inference  algorithms 
established  for  finite-state  grammatical  inference. 

Two  of  them  are  practical  In  Implementation  and  easy 
to  apply,  and  wilt  be  illustrated  here. 


I.  Derivative  grammars 

Definition  I:  The  formal  derivative  of  a set  of 

strings  A with  respect  to  the  symbol  a c V-  Is  defined 

. 12 

**  DaA  - (X|aXcA) 

The  canonical  derivative  finite-state  grammar^  Gj0 

associated  with  a positive  sample  S*  • {X. ,X.. ...iO  Is 
defined  as  follows: 

°CD  “ {VV  *•  °} 

a.  Let  U - (Uj,U2 ,Ur>  be  the  distinct  de- 

rivative of  S+  not  equal  to  X or  p where  X and  $ re- 
present the  null  string  and  the  empty  set  respectively. 

let  0,  - Dx  S+. 

b.  0 ■ U, 


d.  The  set  of  all  distinct  symbols  found  In  S Is 
VT 

e.  R Is  defined  as  follows: 

0j  * a Uj  If  and  only  If  Da  Uj  ■ Uj 

Uf  •*  a If  and  only  If  A c D#  Uj 

2.  *-Tal1  derived  grammars 

Definition  2:  A derived  grammar  Gp  ■ 

(Vu  , V_,  R_,  B ) Is  a grammar  generated  from  a canon I- 
N,.  7 VO 

0 6 
cat  grammar  G£  by  the  following  procedure  : 

a.  The  terminal  set  VT  Is  the  same  for  Cq  and  G^ 

b.  The  nonterminal  set  corresponds  to  a partition 
of  Vu  , where  V Is  the  set  of  nonterminals  of  G. 

WC  C 

c.  8_  Is  the  start  symbol  corresponding  to  the 

0 

block  In  the  partition  containing  a 

d.  Rq  is  defined  as  follows: 

1.  Bj  * a Bj  Is  In  Rp  If  and  only  If  there  exists 


2„.  2«  c V #ueh  tMt 


*«•  2-  e »:•  2„  c B. 


2.  Bj  - a Is  In  RQ  If  and  only  If  there  exists 

Za  t VN  such  that  Z0  ♦ a,  Z0  e B( 

C 

Definition  3:  Let  u - a^..^  c Vy  , and  let 

ACL(G).  The  K-tall  of  A with  respect  to  u is  de- 
fied as  g(u,A,K)  - {X|XeOuA,  |x|  <K). 

Blermann  and  Feldman  " have  proposed  a method  of 
applying  the  idea  of  K-tall  equivalence  to  partition 
the  nonterminal  set  of  a derivative  grammar  In  ob- 
taining the  derived  graenars.  Let  U(  and  Uj  be  two 

distinct  states  of  the  canonical  derivative  grammar 
Grn.  These  two  states  are  associated  with  the  de- 

r I vat  Ives  D„  $ and  Dv  S respectively  where  X.  and 

» J* 

Xj  are  sequences  from  . Uj  and  Uj  are  K-tall 


1 
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equivalent  If  and  only  If  g(X,,S+  ,K)  - g(Xj,S+  ,K). 

The  nethod  of  K-tatl  derived  grammars  Is  easy  to  apply. 
In  obtaining  a quick  rough  approximation  of  the  posi- 
tive semple  set,  this  method  Is  practical  and  useful. 

Since  a simple  transition  network  without  any 
pushdown  stores  and  registers  function  exactly  the 
same  as  a finite  automaton  does,  all  the  techniques  of 
finite-state  grammatical  Inference  can  be  applied  to 
the  Inference  of  simple  transition  network  grammars 
without  any  difficulty. 

IV.  Inference  for  Basic  Transition  Networks 

Context-free  grammars  are  more  complicated  than 
finite-state  grammars.  It  has  been  found  that  If  a 
context-free  grammar  Is  non-self-embedding,  then  the 
language  generated  Is  a regular  language.  It  Is  mainly 
this  self-embedding  property  that  distinguishes  a 
context-free  grammar  from  a regular  grammar.  Hence, 
most  Inference  techniques  for  context-free  grammars 
concern  the  revealing  of  the  self-embeddlngs  from  the 
sample  set.  The  theorem  concerning  self-embedding  of 
context-free  grammers  Is  stated  below: 

Theorem  I’' : For  any  context-free  grammar  1(G), 

there  exists  Integers  m and  n such  that  If  there  exists 
a string  Z In  1(G)  with  |z|  > n then  Z can  be  decom- 
posed to  the  form  Z ■ uvuxy,  where  vx  d X and  |vwx|  < 

m such  that  for  I > 0,  u v'  wx*  y Is  In  L(G). 

The  existing  Inference  techniques  for  context-free 
grammars  are  still  limited  to  some  specific  types  of 
context-free  grammars.  Also,  they  often  rely  on  heu- 
ristic methods  during  the  process  of  Inference.  Glps 
has  dascrlbed  a method  to  Infer  a pivot  grammar  which 

Is  an  operator  grammar  In  a very  restricted  form*. 
Crespl-Reghlzzl , through  the  use  of  a structured 
sample  set,  hes  developed  an  Inference  procedure  for 

K-dlstlnct  and  K-homogeneous  context-free  grammar'11. 

For  the  discovery  of  the  self -embedding  structures, 

Solomonoff  has  proposed  a strategy'*.  The  Idea  Is  to 
"guess"  the  elements  of  self-embedding,  "v,  u,  x," 
from  the  given  positive  sample  set.  This  heuristic 
method  Is  not  practical  In  Implementation.  However, 

It  suggests  an  Important  clue  for  later  research  In 
this  area.  Here,  we  will  present  a method  of  revealing 
the  self-eafceddlng  structures  using  the  Idea  of  formal 
derivatives. 

* r* 

Definition  kt  Let  v,  x c Z , and  Sri*-  , then 

V9_(S,K)  • {u|uc£*,  vuxcS  and  |w|<K) 

- Dy  C„( S.K) 

*d*ere  0y(S,K)  “ (wel*  |vucS  and  |w|<R) 

Es(S,K)  - (ucl*  fumes  and  |w|<K) 

Note  that  0^  (S.K)  - (S,K)  - X9X  (S,K)  - S. 

Suppose  that  the  grammar  G being  Inferred  1s 
context-free,  then  by  the  self -embedding  theorem,  for 

some  u,  v,  u,  x,  y,  uv'ux'y  Is  In  1.(6),  where  u,  v,  w, 
x,  y c I*,  v x d X and  I > 0.  This  Implies  that  for  a 
structurally  complete  positive  saa^lc  S*  and  for  some 

K,  that  K - |aa|  then  I • 1 (s\x)  3 <*,)  far  I - 

o.  ».  2.  ...  » * « y 

It  can  be  seen  that  the  substrings  of  (v'wx'|l>p) 


are  generated  by  a recursive  subnetwork  shown  In  Fig.  I. 
The  subnetwork  Is  a finite-state  automaton  which  can  be 
obtained  from  the  derivative  grammar  described  In 
Section  III  using  the  sample  set  S*  ■ (u.vAx),  where  A 

Is  the  name  of  the  subnetwork.  If  such  a subnetwork  Is 
found,  the  substrings  of  (v  ux  |l>0)  In-  S+  are  replaced 
by  a nonterminal  which  Is  the  name  of  the  subnetwork, 
l.e.,  A.  The  procedure  of  revealing  a recursive  sub- 
network Is  then  re-applled  to  the  new  sample  set, 
treating  the  nonterminal  symbol  as  a terminal.  The 
procedure  Is  repeated  until  no  more  recursive  sub- 
networks can  be  found.  Then  the  techniques  of  finite- 
state  automaton  Inference  described  In  Section  III  can 
be  applied  to  complete  the  network  of  the  sentences. 

Given  a sample  set  i* , the  procedure  of  Inferring 
a BTN,  such  that  L(BTN)  3 S*  Is  described  step  by  step 
as  follows: 

Step  I.  Construct  the  derivative  table. 

a.  Put  the  sample  set  S*  In  the  first  column, 
first  row. 

b.  List  all  the  nonempty  D (S+,n)  In  the  first 

vl 

column,  with  row  number  I - 2,),...,  where 
V|CE  , and  n Is  an  Integer  such  that  for  every 
X e S+.  I*l£"- 

c.  List  all  the  nonempty  E (S+,n)  In  the  first 

J 

row,  with  column  number  j - 2,3,...,  where 
Xj  c £+. 

d.  Let  the  element  of  the  table  at  column  j,  row 
I be  denoted  as  T|..  Complete  the  table  with 

T..  - ».  ,or  *t  J>2- 

IJ  V,  Xj 

Step  2.  Find  the  equivalent  classes. 


a.  For  K - 1,2,...,  list  the  equivalent  classes 
*V  t th*  derivatives  y 9 x (S*,K).  If 


vl*  J' 
such  that 


Is  In  UR  t then  for  any  v 

S > (*V) 


(s\k). 


vt" , *J"  ' V "J' 

v,„  • Xj„  Is  In  U^t.  If  there  are  L distinct 

equivalent  classes  for  K,  then  l ■ 1,2,.. .,1* 


b.  For  K ■ 1,2,...,  examine  UK  t,  I ■ t,2,...,L. 

If  a subset  of  U_  , Is  found  to  be  a subset  of 
the  sot 

(y  v»  « „l  yl«.  v,  x,  y c t* , vx  d X,  I > 0), 
go  to  Step  3,  otherwise,  to  Step  b. 


list  1-  Construct  a subnetwork  for  the  self-embedding 
a.  Since  C yl  • ^1  y|u,  v,  x,  y c £*, 
vx  d X,  t > 9),  substrings  v,  x are  found. 

It  y y«,  y * •"  Vl  •Meh  t,*t  ^ 

y u‘«  'x'lyc  »,  > V **• 


#f  V*  ** 


I 9 I 
u v'l  x'l  y 


(t^ylmp  1,  |wJ«K).  Form  a sample  sat 


J 


S*  i xl,>  W (vA*>  where  A Is  the 

new  of  the  new  subnetwork. 

b.  Construct  a derivative  finite-state  grammar 
for  S * using  the  procedure  described  In 
Section  III. 

c.  Obtain  the  subnetwork  from  the  derivative 
grammar.  Note  that  an  arc  In  the  TN  with  a 
nonterminal  as  Its  label  Is  a PUSH  arc. 

d.  Replace  the  substring  {v*  <t>m  x1  | I > 1,  m>o) 
In  S*  with  A. 

e.  So  to  Step  I. 

Stsfcb.  Construct  the  network  for  the  sentences. 

a.  Construct  the  derivative  finite-state  grammar 
for  t*. 

b.  Obtain  the  subnetwork  S from  the  derivative 

grammar. 

c.  The  name  of  the  subnetwork  S Is  the  start 
state  of  the  Inferred  BTN. 

d.  The  Inferred  BTN  Is  the  set  of  all  the  sub- 
networks Inferred. 

e.  STOP 

The  procedure  Is  easy  and  practical  to  Implement 
on  a computer.  Any  sample  set  with  a reasonable  size 
which  Is  large  enough  to  Imply  the  self-embedding 
structures  of  the  language  and  Is  not  so  large  as  to 
consume  up  the  computer  memory,  can  be  put  on  a com- 
puter to  Infer  Its  BTN. 

Example  It  Consider  a language  L - 


Example  ft  Cons 
{blab^bat>  Jt,k>|}. 


|«  I* 

_ The  substring  'ab  cb  a1  Is  em- 
bedded between  the  b's  In  the  sentence  and  the  symbol 
'c'  Is  embedded  between  the  b's  In  the  substring.  The 
language  could  be  the  encoded  strings  of  two  arms  of 
chromosomes  on  each  side  of  the  centromere  con- 
striction. This  can  be  shown  In  Fig.  2. 

A sample  set  of  size  35  which  Is  listed  In  Table  I 
Is  fad  to  the  computer  program  Implementing  the  In- 
ference procedure  as  stated.  After  the  table  of  the 
derivatives  Is  completed,  an  equivalent  class  of  value 
'c'  Is  found.  The  class  Is  shown  as  Table  2(a)  which 
Is  summarized  as  Table  2(b).  Examining  Table  2(b),  a 
so If -embedding  'e'  Is  found  with  substring  v ■ x • b. 

A subnetwork  for  the  sample  set  $A*  - (bcb.bAb)  Is  con- 
structed as  Fig.  3. 

Table  I.  The  Sample  Set  for  the  BTN  Inference  Experl- 


babbbbcbbbbab 

babbbbbcbbbbbab 

babbbbbbcbbbbbbab 

bbbabcbabbb 

bbbabbcbbabbb 

bbbabbbcbbbabbb 

bbbabbbbcbbbbabbb 

bbbabbbbbcbbbbbabbb 


bbabbcbbabb 

bbabbbcbbbabb 

bbabbbbcbbbbabb 

bbabbbbbcbbbbbab 

bbabbbbbbcbbbbbbabb 

bbbbabcbabbbb 

bbbbabbcbbabbbb 

bbbbabbbcbbbabbbb 

bbbbabbbbcbbbbabbbb 

bbbbabbbbbcbbbbbabbbb 


bbbbbabcbabbbbb 

bbbbbabbcbbabbbbb 

bbbbbabbbcbbbabbbbb 

bbbbbabbbbcbbbbabbbbb 

bbbbbabbbbbcbbbbbabbbbb 

bbbbbabbbbbbcbbbbbbabbbbb 


bbbbbbabcbabbbbbb 
bbbbbb  bbcbbabbbbbb 
bbbbbbabbbcbbbabbbbbb 
bbbbbbabbbbcbbbbabbbbbb 
bbbbbbabbbbbcbbbbbabbbbbb 


Table  2(a) . The  Equivalent  Class  of  Value  'c1. 


°bab  Ebab 
°babb  Ebbab 
Dbbab  Ebabb 
°babbb  Ebbbab 
°bbabbb  Ebbbabb 
°bbbabb  Ebbabbb 
°bbbbab  Ebabbbb 
°babbbbb  Ebbbbbab 


°bbabbbb  Ebbbbabb 
°bbbabbb  Ebbbabbb 
Dbbbbabb  Ebbabbbb 
Dbbbbbab  Ebabbbbb 
Dbabbbbbb  Ebbbbbbab 
°bbabbbbb  Ebbbbbabb 
°bbbabbbb  Ebbbbabbb 
°bbbbabbb  Ebbbabbbb 


Table  2(b).  Summary  of  Table  2(a). 

I < I < 6 


‘‘bab1  Eb'ab 

Vab'  Ev'.b2 
°bV  Eb'.b’ 
V.b«  Eb'.b3 

Vab'  Eb'.bS 


1 < I < 5 

I < I < A 

1 < I < 3 

I - I 


Replacing  substrings  (b  eb  |l>l)  In  the  sample  set 


of  Table  I by  the  nonterminal  symbol 
sample  set  shown  In  Table  3. 


we  get  a new 


Table  jj.  The  Sample  Set  After  Replacement. 

beAab  bbbbaAabbbb 


bbaAabb 

bbbaAabbb 


bbbbbaAabbbbb 

bbbbbbaAabbbbbb 


The  equivalent  class  of  the  derivatives  of  the 
new  sample  set  Is  shown  In  Table  A.  A self-embedding 
of  'aAa'  Is  found  with  v-x-b.  The  subnetwork  for  the 

sample  set  ■ {baAab,  bBb)  Is  constructed  as  Fig.  A. 

Replacing  substrings  (b*aAab*|  l>_l)  In  the  sample 
set  of  Table  3 by  the  symbol  'B',  we  get  the  set  (6). 
The  Inference  procedure  Is  completed  with  the  In- 
ferred transition  network  B shown  In  Fig.  5.  The 
language  generated  by  this  transition  network  is 

L - {blabkcbkabl|k,  t > I). 

Table  A.  The  Equivalent  Classes  of  the  Derivatives  of 
— the  Sample  Set  In  Table  3. 


•A* 

°baEab 
°bba  £abb 
°bbba  Eabbb 
°bbbba  Eabbbb 
°bbbbba  Eabbbbb 
°bbbbbba  Eabbbbbb 


•Aa' 

DbaEb 


Ebb 


bba 
°bbbaEbbb 
°bbbba  Ebbbb 
°bbbbba  Ebbbbb 
°bbbbbba  Ebbbbbb 


bbbabbbbbbcbbbbbbabbb  bbbbabbbbbbcbbbbbbabbbb 


V.  Conclusion*  and  Remark* 


• 

'*A' 

•Aab1 

°bE.b 

°bbEabb 

°baEX 
°bb*  Eb 

DbbbEabbb 

°bbba  Ebb 

°bbbb  Eabbbb 

°bbbba  Ebbb 

°bbbbb  Eabbbbb 

°bbbbba  Ebbbb 

°bbbbbb  Eabbbbbb 

Dbbbbbba  Ebbbbb 

■baA1 

'•A.' 

DXE.b 
Db  Eabb 

DbEb 
°bb  Ebb 

°bb  Eabbb 

Dbbb  Ebbb 

°bbb  Eabbbb 

Dbbbb  Ebbbb 

°bbbb  Eabbbbb 

°bbbbb  Ebbbbb 

°bbbbb  Eabbbbbb 

Dbbbbbb  Ebbbbbb 

An  analysis  of  CSL  In  terms  of  transformational 
grammars  can  be  found  In  Ref.  11.  A CSC  can  be  seen 
as  a CFG  (base)  and  a set  of  transformational  rules. 

The  CSL  Is  obtained  by  applying  a sequence  of  trans- 
formations to  the  CFL  generated  by  the  CFG.  For  a 
given  sample  of  CSL,  suppose  that  some  reverse  trans- 
formations are  assumed,  then  a sample  set  of  CFL  can 
be  obtained  by  applying  the  reverse  transformations. 

To  complete  an  ATN  for  the  CSL,  the  technique  of  BTN 
Inference  Is  used  to  construct  the  BTN  for  the  CFL  and 
then  augmented  arcs  are  added  to  achieve  the  trans- 
formations. It  can  be  seen  that  the  set  of  assumed 
reverse  transformations  plays  an  Important  role  In  the 
resulting  ATN.  The  system  of  ATN  Inference  may  be 
operated  under  a supervisor  in  the  fashion  of  trial  and 
error.  For  example,  the  sketch  of  an  ATN  inference 
system  shown  In  Fig.  6 may  be  a possible  solution. 

For  the  Illustration  of  the  ATN  Inference  technique, 
Example  2 Is  given  below. 

Example  2:  A sample  set  shown  In  Table  5 Is 
given  to  the  ATN  Inference  system  sketched  In  Fig.  8. 
A reverse  transformation  'bbc-*bcb'  to  the  strings  Is 
assumed.  The  sample  set  obtained  after  applying  the 
reverse  transformation  Is  shown  In  Table  6. 

Table  5 

Table  8 

•be 

•be 

a2b2c2 

.V)2 

.Vc3 

•3<bc)3 

AV 

•Nbc)* 

.Vc5 

•5(bc)5 

.Vc5 

•6(bc)6 

.Vc7 

•7(bc)7 

Now,  for  the  new  sample  set,  a BTN  shown  In  Fig.  7 
Is  Inferred  using  the  technique  described  previously  In 
this  Section.  To  complete  the  augmented  transition 
network,  two  augmented  arcs,  7 end  8,  are  added.  The 
resulting  ATN  Is  shown  In  Fig.  8.  It  Is  clear  that  tha 
given  sample  set  Is  a subset  of  the  language  generated 
by  the  Inferred  ATN. 


In  this  paper,  the  Inferences  of  transition  net- 
work grammars  are  presented.  In  particular,  a strategy 
to  reveal  the  self-embedding  structures  In  context-free 
lanaguages  has  been  proposed.  It  Is  found  to  be  a 
rather  systematic  and  practical  approach  compard  to  the 
existing  heuristic  methods. 

Although  the  research  of  finite-state  grammatical 
Inference  has  been  reasonably  successful,  the  research 
In  the  whole  area  of  grammatical  Inference  Is  still  In 
Its  Infancy.  In  many  cases,  finite-state  grammars  are 
not  powerful  enough  to  fully  characterize  the  language 
under  study.  Therefore,  more  complex  grammars  are 
needed.  The  subject  of  grammatlcel  Inference  for 
grammars  of  types  other  than  finite  state  Is  of  In- 
creasing Importance.  It  has  been  proven  thet  trans- 
formational grammars  are  es  powerful  as  type  0 grammars. 
The  close  relationship  between  transition  network  .. 
grammars  and  transformational  grammars  has  been  shown. 
Clearly,  a context-sensitive  grammar  can  be  represented 
as  a context-free  grammar  plus  a set  of  transformation 
rules.  These  can  be  fully  expressed  In  terms  of  basic 
transition  networks  with  a set  of  augmented  arcs.  It 
turns  out  that  the  basic  transition  networks,  which 
correspond  to  the  context-free  gremmar,  are  the  foun- 
dations of  grammars  of  different  complexities.  The 
Inference  of  basic  transition  network  grammars  becomes 
e key  to  the  area  of  grammatical  Inference.  Much  work 
remains  to  be  done;  however.  It  Is  hoped  that  more  In- 
terest and  research  will  be  stimulated  on  this  subject. 
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Fig.  6.  An  ATM  Inference  System 


Fig.  7.  The  Inferred  BTN 


Fig.  2.  A sequential  embedding  example 


Fig.  P.  The  Inferred  ATN 
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